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60/189,680, filed on March 15, 2000 and U.S. Provisional Patent AppUcation No. 
60/207,700, filed on May 26, 2000. 

BACKGROUND 

The invention generally relates to wireless communication systems. In particular, the 
invention relates to joint detection of multiple user signals in a wireless conraiunication 
system. 

Figure 1 is an illustration of a wireless communication system 10. The 
communication system 10 has base stations 12i to 125 which communicate with user 
equipments (UEs) 14^ to 14^ Each base station 12i has an associated operational area where 
it communicates with UEs 14i to 14^ in its operational area. 

In some communication systems, such as code division multiple access (CDMA) and 
time division duplex using code division multiple access (TDD/CDMA), multiple 
communications are sent over the same frequency spectrum. These communications are 
typically differentiated by their chip code sequences. To more efficiently use the frequency 
spectrum, TDD/CDMA communication systems use repeating frames divided into time slots 
for communication. A communication sent in such a system will have one or multiple 
associated chip codes and time slots assigned to it based on the communication's bandwidth. 

Since multiple communications may be sent in the same frequency spectrum and at 
the same time, a receiver in such a system must distinguish between the multiple 
communications. One approach to detecting such signals is matched filtering. In matched 
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filtering, a communication sent with a single code is detected. Other communications are 
treated as interference. To detect multiple codes, a respective number of matched filters are 
used. Another approach is successive interference cancellation (SIC). In SIC, one 
communication is detected and the contribution of that communication is subtracted from 
the received signal for use in detecting the next communication. 

In some situations, it is desirable to be able to detect multiple conmiunications 
simultaneously in order to improve performance. Detecting multiple communications 
simultaneously is referred to as joint detection. Some joint detectors use Cholesky 
decomposition to perform a minimum mean square error (MMSE) detection and zero- 
forcing block equahzers (ZF-BLEs). These detectors have a high complexity requiring 
extensive receiver resources. 

Accordingly, it is desirable to have alternate approaches to multi-user detection. 

SUMMARY 

A time division duplex communication system using code division multiple access 
transmits a plurality of data signals over a shared spectrum in a time slot. A combined signal 
is received over the shared spectrum in the time slot. The plurality of data signals are 
grouped into a plurality of groups. The combined signal is matched filtered based on in part 
symbol responses associated with the data signals of one of the groups. Data from each data 
signal in the one group is jointly detected. An interference signal is constructed based on 
in part the one group detected data. The constructed interference signal is subtracted from 
the combined signal. Data from the other groups is detected by processing the subtracted 
signal. 

BRIEF DESCRIPTION OF THE DRAWING(S) 
Figure 1 is a wireless communication system. 

Figure 2 is a simplified transmitter and a receiver using joint detection. 
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Figure 3 is an illustration of a communication burst. 

Figure 4 is a flow chart of adaptive combination of joint detection and successive 
interference cancellation. 

Figure 5 is an illustration of an adaptive combination of joint detection and 
successive interference cancellation device. 

Figures 6-12 are graphs comparing the performance of adaptive combination of joint 
detection and successive interference cancellation, full joint detection and a RAKE receiver. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT(S) 
Figure 2 illustrates a simplified transmitter 26 and receiver 28 using an adaptive 
combination of joint detection (JD) and successive interference cancellation (SIC), "SIC- 
JD", in a TDD/CDMA communication system. In a typical system, a transmitter 26 is in 
each UE 14^ to I43 and multiple transmitting circuits 26 sending multiple conmiunications 
are in each base station 12^ to I25. A base station 12^ will typically require at least one 
transmitting circuit 26 for each actively communicating UE 14^ to I43. The SIC-JD receiver 
28 may be at a base station 12i, UEs 14^ to I43 or both. The SIC-JD receiver 28 receives 
communications from multiple transmitters 26 or transmitting circuits 26, 

Each transmitter 26 sends data over a wireless radio channel 30. A data generator 32 
in the transmitter 26 generates data to be communicated over a reference channel to a 
receiver 28. Reference data is assigned to one or multiple codes and/or time slots based on 
the conmiunications bandwidth requirements. A modulation and spreading device 34 
spreads the reference data and makes the spread reference data time-multiplexed with a 
training sequence in the appropriate assigned time slots and codes. The resulting sequence 
is referred to as a communication burst. The communication burst is modulated by a 
modulator 36 to radio frequency. An antenna 38 radiates the RF signal through the wireless 
radio channel 30 to an antenna 40 of the receiver 28. The type of modulation used for the 
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transmitted communication can be any of those known to those skilled in the art, such as 
direct phase shift keying (DPSK) or quadrature phase shift keying (QPSK). 

A typical communication burst 1 6 has a midamble 20, a guard period 1 8 and two data 
bursts 22, 24, as shown in Figure 3. The midamble 20 separates the two data bursts 22, 24 
5 and the guard period 18 separates the communication bursts to allow for the difference in 
arrival times of bursts transmitted from different transmitters. The two data bursts 22, 24 
contain the communication burst's data and are typically the same symbol length. The 
midamble contains a training sequence. 

The antenna 40 of the receiver 28 receives various radio frequency signals. The 
10 received signals are demodulated by a demodulator 42 to produce a baseband signal. The 
:i baseband signal is processed, such as by a channel estimation device 44 and a SIC-JD device 
46, in the time slots and with the appropriate codes assigned to the communication bursts 
W of the corresponding transmitters 26. The channel estimation device 44 uses the training 
^0 sequence component in the baseband signal to provide channel information, such as channel 
: i 5 impulse responses. The channel information is used by the SIC-JD device 46 to estimate the 
Jlj transmitted data of the received communication bursts as hard symbols. 
iZ The SIC-JD device 46 uses the channel information provided by the channel 

estimation device 44 and the known spreading codes used by the transmitters 26 to estimate 
the data of the various received communication bursts. Although SIC-JD is described in 
20 conjunction with a TDD/CDMA communication system, the same approach is applicable 
to other communication systems, such as CDMA. 

One approach to SIC-JD in a particular time slot in a TDD/CDMA communication 
system is illustrated in Figure 4. A number of communication bursts are superimposed on 
each other in the particular time slot, such as K communication bursts. The K bursts may 
25 be from K different transmitters. If certain transmitters are using multiple codes in the 
particular time slot, the K bursts may be from less than K transmitters. 
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Each data burst 22, 24 of the connniinication burst 16 has a predefined number of 
transmitted symbols, such as N^. Each symbol is transnaitted using a predetermined number 
of chips of the spreading code, which is the spreading factor (SF). In a typical TDD 
communication system, each base station 12^ to 12^ has an associated scrambling code 
5 mixed with its communicated data. The scrambling code distinguishes the base stations 
from one another. Typically, the scrambling code does not affect the spreading factor. 
Although the terms spreading code and factor are used hereafter, for systems using 
scrambling codes, the spreading code for the following is the combined scrambling and 
spreading codes. As a result, each data burst 22, 24 has x SF chips. After passing 
10 through a channel having an impulse response of W chips, each received burst has a length 
^3 of SF X + W - 1 , which is also represented as N^, chips. The code for a burst of the K 
' '4 bursts is represented by 

M Each burst is received at the receiver and can be represented by Equation 1 . 

Equation 1 

H 5 r^^^ is the received contribution of the k^ burst. A^^^ is the combined channel response, being 
Q an N^, X Ng matrix. Each column in A^^^ is a zero-padded version of the symbol response 
s^*"^ of the element of d*\ The symbol response s^^^ is the convolution of the estimated 
response h^^^ and spreading code C^^^ for the burst, d^^^ is the unknown data symbols 
transmitted in the burst. The estimated response for each k* burst, h^^\ has a length W chips 
20 and can be represent by Equation 2. 

Equation 2 
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Y^^-* reflects the transmitter gain and/or path loss. h_ represents the burst-specific fading 

channel response or for a group of bursts experiencing a similarly channel, /?_ represents 
the group-specific channel response. For uplink communications, each h^^^ as well as each 
Y^^^ and ^ are distinct. For the downlink, all of the bursts have the same ^ but each 

is different. If transmit diversity is used in the downlink, each y^^^ and n_ are distinct. 

The overall received vector from all K bursts sent over the wireless channel is per 
Equation 3. 

Equation 3 

n is a zero-mean noise vector. 

By combining the A^^^ for all data bursts into matrix A and all the unknown data for 
each burst d^^^ into matrix d. Equation 1 becomes Equation 4. 
r = A^ + n 

Equation 4 

SIC-JD determines the received power of each k^ burst. This determination may be 
based on apriori knowledge at the receiver 28, burst-specific channel estimation from a 
burst-specific training sequence, or a bank of matched filters. The K bursts are arranged in 
descending order based on the determined received power. 

Bursts having roughly the same power level, such as within a certain threshold, are 
grouped together and are arranged into G groups, 48. The G groups are arranged into 
descending order by their power, such as from group 1 to G with group 1 having the highest 
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received power. Figure 5 is an illustration of a SIC-JD device 46 performing SIC-JD based 
on the G groups. 

For the group with the highest received power, group 1, the symbol response matrix 
for only the bursts in group 1, A/^\ is determined. A/^^ contains only the symbol responses 
of the bursts in group 1 . The received vector, r, is modeled for group 1 as x/'\ As a result, 
Equation 4 becomes Equation 5 for group 1 . 

Equation 5 

d/^^ is the data in the bursts of group 1. Equation 5 addresses both the effects of inter 
symbol interference (ISI) and multiple access interference (MAI). As a result, the effects 
of the other groups, groups 2 to G, are ignored. 

The received vector, x/^\ is matched filtered to the symbol responses of the bursts 
in group 1 by a group 1 matched filter 66i, such as per Equation 6, 50. 

Equation 6 

^g^^^ is the matched filtered result. 

A joint detection is performed on group 1 by a group 1 joint detection device 68i to 

A (1) 

make a soft decision estimate of d g,soft , using the matched filtered result \ One JD 
approach is to compute the least-squares, zero-forcing, solution of Equation 7. 

Equation 7 
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A is the hermetian of ^ . Another JD approach is to compute the minimum mean 

6 6 

square error solution (MMSE) as per Equation 8. 

Equation 8 

I is the Identity matrix and is the standard deviation. 
5 One advantage to performing joint detection on only a group of bursts is that the 

complexity of analyzing a single group versus all the signals is reduced. 

Since Af^ and A^^^ are banded block Toeplitz matrices, the complexity in solving either 

ffl Equation 7 or 8 is reduced. Additionally, Cholesky decomposition may be employed with 
a negligible loss in performance. Cholesky decomposition performed on a large number of 

^||0 bursts is extremely complex. However, on a smaller group of users, Cholesky 
decomposition can be performed at a more reasonable complexity. 

J^^ A (1) 

J^' The soft decisions, d g^soft . are converted into hard decisions, d g^kard , by soft to hard 

decision block 10^ as the received data for group 1 , 54. To process the other weaker groups, 
the multiple access interference caused by group 1 onto the weaker groups is estimated by 
15 a group 1 interference construction block 12^ using Equation 9, 56. 

Equation 9 

a(]) 

r is the estimated contribution of group 1 to r. 
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For the next group 2, the estimated contribution of group 1 is removed from the 
received vector, x/^\ to produce x/', such as by a subtracter 14^, as per Equation 10, 58. 

Equation 10 

As a result, multiple access interference from group 1 is effectively canceled from the 
received signal. The next strongest group, group 2, is processed similarly using x/^ with 
group 2 matched filter 662, group 2 JD block 682, soft to hard decision block 7O2 and group 
2 interference construction block 722, 60. The constructed group 2 interference, , is 
subtracted, such as by subtractor 242, from the interference cancelled signal for group 2, 
^(2) _ -(2) =x^^\ 62. Using this procedure, each group is successively processed until the 

final group G. Since group G is the last group, the interference construction does not need 
to be performed. Accordingly, group G is only processed with group G matched filter 66^, 
group G JD block 68g and soft to hard decisions block VOq to recovery the hard symbols, 64. 

When SIC-JD is performed at a UE Uj, it may not be necessary to process all of the 
groups. If all of the bursts that the UE 14i is intended to receive are in the highest received 
power group or in higher received power groups, the UE 14i will only have to process the 
groups having its bursts. As a result, the processing required at the UE can be further 
reduced. Reduced processing at the UE 14i results in reduced power consumption and 
extended battery hfe. 

SIC-JD is less complex than a single-step ID due to the dimension x K • N, matrix 
being replaced with G JD stages of dimension N, x n; • N„ where i = 1 to G. n, is the number 
of bursts in the i* group. The complexity of JD is proportional to the square to cube of the 
number of bursts being jointly detected. 

An advantage of this approach is that a trade-off between computational complexity 
and performance can be achieved. If all of the bursts are placed in a single group, the 
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solution reduces to a JD problem. The single grouping can be achieved by either forcing all 
the bursts into one group or using a broad threshold. Alternately, if the groups contain only 
one signal or only one signal is received, the solution reduces to a SIC-LSE problem. Such 
a situation could result using a narrow threshold or forcing each burst into its own group, 
by hard limiting the group size. By selecting the thresholds, an optional tradeoff between 
performance and complexity can be achieved. 

Figures 6 to 12 are simulation results that compare the bit error rate (BER) 
performance of SIC-JD to full JD and RAKE-like receivers under various multi-path fading 
channel conditions. The parameters chosen are those of the 3GUTRA TDD CDMA system: 
SF = 61 and W = 57. Each TDD burst/time-slot is 2560 chips or 667 microseconds long. 
The bursts carry two data fields with Ns QPSK symbols each, a midamble field and a guard 
period. Each simulation is run over 1000 timeslots. In all cases the number of bursts, K is 
chosen to be 8 . All receivers are assumed to have exact knowledge of the channel response 
of each burst, which is used to perfectly rank and group the bursts. The channel response 
is assumed to be time-invariant over a time-slot, but successive time-slots experience 
uncorrelated channel responses. No channel coding was appUed in the simulation. The JD 
algorithm jointly detects all K bursts. The RAKE-like receiver was a bank of matched 

filters, /'^ = A"^" rj'^ , for an i* burst's code. The maximal ratio combiner (MRC) stage is 

implicit in these filters because they are matched to the entire symbol-response. 

The performance was simulated under fading channels with multi-path profiles 
defined by the ITU channel models, such as the Indoor A, Pedestrian A, Vehicular A 
models, and the 3GPP UTRA TDD Working Group 4 Case 1, Case 2 and Case 3 models. 
In Vehicular A and Case 2 channels, the SIC-JD suffered a degradation of up to 1 decibel 
(dB) as compared to the full JD in the 1% to 10% BER range. For all other channels, the 
SIC-JD performance was within 0.5 dB of that of the full JD. Since Vehicular A and Case 
2 represent the worst-case amongst all cases studied, only the performance plots are shown. 
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Amongst all channels simulated, Vehicular A and Case 2 have the largest delay spread. 
Vehicular A is a six tap model with relative delays of 0, 310, 710, 1090, 1730 and 2510 
nanoseconds and relative average powers of 0, -1, -9, -10, -15 and -20 decibels (dB). Case 
2 is a 3 tap model, all with the same average power and with relative delays of 0, 976 and 
1200 nanoseconds. 

Figures 6 and 7 compare the bit error rate (BER) vs. the chip-level signal to noise 
ratio (SNR) performance of the SIC-LSE receiver with the full JD and RAKE-like receivers 
under two multi-path fading channel conditions. The group size is forced to be 1, to form 
K groups, both, at the transmitter and receiver. The theoretical binary phase shift keying 
(BPSK) BER in an additive white gaussian noise (AWGN) channel that provides a lower 
bound to the BER is also shown. The BER is averaged over all bursts. Figure 6 represents 
the distinct channel case wherein each burst is assumed to pass through an independently 
fading channel but all channels have the same average power leading to the same average 
SNR. Thus, in this case, h^'\i=l---K are distinct while y'-'Ki =1---K are all equal. Such 

a situation exists in the uplink where the power control compensates for long-term fading 
and/or path-loss but not for short-term fading. At each time-slot, the bursts were arranged 
in power based upon the associated h}'^ ,i = h--K . Figure 7 shows similar plots for the 
common channel case. All bursts are assumed to pass through the same multi-path channel, 
i.e., = h--K and are all equal, but with different Y^'\i=h--K . The arechosen 

such that neighboring bursts have a power separation of 2 dB when arranged by power level. 
Such difference in power can exist, for instance, in the downhnk where the base station 12i 
applies different transmit gains to bursts targeted for different UEs 14i to Mg. Figures 6 and 
7 show that in the range of 1% to 10% bit error rate (BER), the SIC-LSE suffers a 
degradation of less than 1 dB as compared to the JD. This is often the range of interest for 
the uncoded BER (raw BER). The RAKE receiver exhibits significant degradation, since 
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it does not optimally handle the ISI. As the power differential between bursts increases, the 
performance of SIC-LSE improves. Depending upon the channel, a power separation of 1 
to 2 dB is sufficient to achieve a performance comparable to that of the full JD. 

Figures 8, 9, 10 and 11 compare the BER vs. SNR performance of the SIC-JD 
receiver with the full JD and RAKE-like receivers under two multi-path fading channels. 
The 8 codes are divided into 4 groups of 2 codes each at the transmitter and receiver. The 
BER is averaged over all bursts. Figures 8 and 9 represent the distinct channel case wherein 
different groups are assumed to pass through independently fading channels. However, all 
channels have the same average power leading to the same average SNR. All bursts within 
the same group are subjected to an identical channel response. In this case, Lg ,g = i---G 
are all distinct, but the channel responses, = 1, • • , for each burst in the group are 
equal, is the number of bursts in the g'^ group. This potentially represents a multi-code 

scenario on the uphnk, where each UE 14^ transmits two codes. The SIC-JD receiver 28 
groups the multi-codes associated with a single UE 14i into the same group, thus forming 
4 groups. Figures 10 and 1 1 represent the common channel case. All groups are assumed 

to pass through the same multi-path channel, i.e., L^g\g =l---«g are all equal, but with 

different 7^ - <? = 1 ' • ' <^ . The are chosen such that, when arranged according to power, 
neighboring groups have a power separation of 2 dB. This potentially represents a multi- 
code scenario on the downlink where the base station 12j transmits 2 codes per UE 14^. 
Figures 10 and 1 1 show a trend similar to that observed for the SIC-LSE shown in Figures 
8 and 9. SIC-JD has a performance comparable (within a dB) to the JD in the region of 1 % 
to 10% BER, which is the operating region of interest for the uncoded BER. Depending 
upon the channel, a power separation of 1 to 2 dB is sufficient to achieve a performance of 
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SIC-LSE comparable to that of the full JD. As shown, performance improves as the power 
separation between bursts increases. 

Figure 12 is similar to Figure 10, except that there are only two groups with 4 bursts 
each. As shown in Figure 12, SIC-JD has a performance comparable (within a dB) to JD 
in the region of 1% to 10% BER. 

The complexity of SIC-JD is less than full JD. The reduced complexity stems from 
the replacement of a single-step JD which is a dimension N ^xK -N ^viith G JD stages of 
dimension N ^xn^-N ^, i =l---G . Since, typically, JD involves a matrix inversion, whose 
complexity varies as the cube of the number of bursts, the overall complexity of the multi- 
stage JD can be significantly lower than that of the single-stage full JD. Furthermore, the 
complexity of the SIC part varies only hnearly with the number of bursts, hence it does not 
offset this complexity advantage significantly. For instance, the complexity of the G-1 
stages of interference cancellation can be derived as follows. Since successive column 

blocks of aI'^ are shifted versions of the first block and assuming that elements of d^g\^,i 
belong to 1 of 4 QPSK constellation points, the 4 • possible vectors can be computed that 
are needed in computing the product A'-^^t^l^.i- This step requires 

P G-l 

Aa-(SF+W -\) • y n million real operations per sec (MROPS). a =4 is the 

number of real operations per complex multiplication or multiply and accumulate (MAC). 
Rate is the number of times the SIC-JD is performed per second. With these 4 -n, vectors 

Rate 

aheady computed, the computation of 4'^'^ requires —• N , -{SF +W -1) •—t-Yj"'i 
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MROPS. The factor of — comes from the fact that only complex additions are involved. 

2 

Hence, only 2 real operations are required for each complex operation. It then follows that 
the complexity of G -1 stages of interference cancellation can be expressed by Equation 
11. 

Equation 1 1 

The complexity of converting soft to hard decisions is neghgible. 

There are several well-known techniques to solve the matrix inversion of JD. To 
illustrate the complexity, an approach using a very efficient approximate Cholesky factor 
algorithm with neghgible loss in performance as compared to the exact Cholesky factor 
algorithm was used. The same algorithm can be employed to solve group-wise JD. The 
complexity of the full JD and the SIC-JD for the 3GPP UTRA TDD system is shown in 
Table 1. Table 1 compares their complexity for various group sizes. It can be seen that as 
K increases or as the group size decreases the complexity advantage of the SIC-JD over the 
full JD increases. The complexity for group size 1, of the SIC-LSE, varies hnearly with K 
and is 33% that of the full JD for ii: = 16. Note that maximum number of bursts in the 
UTRA TDD system is 16. The complexity advantage of the SIC-JD over full JD will be 
even more pronounced when the exact Cholesky decomposition is employed. Exact 
Cholesky decomposition's complexity exhibits a stronger dependence on K, leading to more 
savings as the dimension of the JD is reduced via SIC-JD. 
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Total number of 
bursts 


Complexity of the SIC-JD expressed as a percentage of the 
complexity of the single-step JD of all bursts 




K groups of 
size 1 each 
(SIC-LSE) 


K / 2 groups of 
size 2 each 


K / 4 groups of 
size 4 each 


K / 8 groups of 
size 8 each 


8 


63% 


67% 


76% 


100% 


16 


33% 


36% 


41 % 


57% 



Table 1 

As shown in Table 1 , when the number and size of codes is made completely adaptive 
on an observation interval-by-observation interval basis, the SIC-JD provides savings, on 
average, over full JD. Since, on average, all bursts do not arrive at the receiver with equal 
power, depending upon the grouping threshold, the size of the groups will be less then the 
total number of arriving bursts. In addition, a reduction in peak complexity is also possible 
if the maximum allowed group size is hard-limited to be less than the maximum possible 
number of bursts. Such a scheme leads to some degradation in performance when the 
number of bursts arriving at the receiver with the roughly the same power exceeds the 
maximum allowed group size. Accordingly, SIC-JD provides a mechanism to trade-off 

performance with peak complexity or required peak processing power. 

* * * 
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